^X--^plicing Factor 
The invention relates to proteins comprising splicing 

^ factor activity in plants. 

Alternative pre-mRNA-splicing is part of the 

5 expression program of a large number of genes in animals 
and plants. It allows the selection of different 
combinations of splice sites within a given pre-mRNA, 
generating structurally and functionally distinct protein 
isoforms (Breitbart et al. 1987; Manley and Tacke 1996; 

10 Caceres and Krainer, 1997). Several protein factors 

involved in the regulation of alternative splicing have 
been described, including a family of RNA-binding proteins 
containing arginine/serine-rich regions (SR proteins) (for 
reviews, see Fu 1995; Chabot 1996; Valcarcel and Green 

15 1995; Caceres and Krainer 1997) . SR proteins are highly 
conserved nuclear phosphoproteins that are members of a 
protein family and share a serine phospho-epitope 
recognized by the monoclonal antibody mAbl04 (Roth et al. 
1991) . They consist at least of one RNA-binding domain 

20 (RBD) [the typical RED or RNA recognition motif (RRM) 
domain of approximately 80 amino aicds] (Birney et al. 
1993) , and possess several serine/arginine (SR) dipeptides 
near their carboxy termini (Zahler et al. 1992) . In 
general, SR proteins are a defined subgroup of a large 

25 superfamily of nuclear proteins with RS-rich domains of 
variable sequence and position (Fu 1995) . 

The human SF2/ASF splicing factor, a prototype of an 
SR protein, is essential for the first cleavage step in 
pre-mRNA splicing (Krainer et al. 1990b; Ge et al. 1991) 

30 and can also determine in a concentration-dependent manner 
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which 5* splice site is selected in pre-mRNAs containing 
alternative sites (Ge and Manley 1990; Krainer et al . 
1990a). The preferential usage of the proximal 5' splice 
site at higher concentrations of SF2/ASF is counteracted by 
members of the hnRNP A/B family of proteins (Mayeda and 
Krainer 1992; Mayeda et al. 1994), suggesting that the 
relative abundance of these and possible other antagonistic 
splicing factors determines the patterns of alternative 
splicing in vitro and in vivo (Mayeda and Krainer 1992; 
Caceres et al. 1994; Yang et al. 1994; Wang and Manley 
1995; Hanamura et al. 1998). SF2/ASF and any other SR 
protein can complement splicing-def icient SlOO extracts 
that essentially lack SR proteins, but individual SR 
proteins sometimes display distinct specificities and 
efficiencies in splicing different pre-mRNAs (Krainer et 
al. 1990b; Ge et al. 1991; Fu and Maniatis 1992; Kim et al. 
1992; Mayeda et al. 1992; Fu 1993; Zahler et al. 1993; 
Screaton et al. 1995; Wang and Manley 1995) . Furthermore, 
an important activity of SF2/ASF and other SR proteins is 
the interaction with exonic enhancer sequences, which 
stimulate usage of an adjacent weak splice site (Sun et al. 
1993; Tian and Maniatis 1993; Staknis and Reed 1994; 
Ramchatesingh et al. 1995; Tacke and Manley 1995; Lou et 
al. 1996). The RNA-binding specificity of an SR protein is 
conferred by the RRM region and adjacent sequences (Caceres 
and Krainer 1993; Zuo and Manley 1993; Tacke and Manley 
1995; Allain and Howe 1997; Tacke et al. 1997); the various 
RS domains are responsible primarily for protein-protein 
interactions, which are modulated by the phosphorylation 
status of these regions (Wu and Maniatis 1993; Amrein et 
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al, 1994; Kohtz et al. 1994; Zuo and Maniatis 1996; Xiao 
and Manley 1997). In addition, it has been demonstrated 
that RS domains modulate the RNA-binding activity and 
subnuclear localization of SR proteins (Li and Bingham 
1991; Hedley et al. 1995; Caceres et al. 1997). Recently, 
SF2/ASF, SRp20, and 9G8 have been shown to shuttle between 
the nucleus and the cytoplasm, and this property depends on 
the presence and type of RS domain (Caceres et al . 1998). 

The studies described above led to the proposal that 
SR proteins might function by bridging splice sites through 
RNA-protein and protein-protein interactions, therefore 
establishing early interactions for splice-site definition 
and for the assembly of spliceosomes . These interactions 
are stimulated by binding of SR proteins to nearby enhancer 
sequences and modulated by 

phosphorylation/dephosphorylation of the RS regions. 

There is limited information about the regulation of 
SR protein expression in vivo. In the case of SF2/ASF and 
SRp20, significant differences in mRNA and protein levels 
have been observed in various cell types and tissues (Jumaa 
et al. 1997; Hanamura et al. 1998). SC35 expression is also 
highly variable in cell lines (Fu and Maniatis 1992; 
Vellard et al. 1992) and some SR proteins are activated by 
mitogens (Diamond et al. 1993; Screaton et al. 1995) . 
Interestingly, several alternatively spliced SR protein 
mRNAs have been described, which code for truncated 
proteins of still unknown function (Ge et al . 1991; Cavaloc 
et al. 1994; Screaton et al. 1995; Jumaa et al. 1997). 
Human SRp20 autoregulates its expression at the level of 
splicing by binding to its own pre-mRNA, thereby preventing 
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constitutive and alternative splicing when assayed in 
heterologous HeLa cell extracts. 

In an effort to isolate individual plant splicing 
factors, arginine/serine-rich proteins from Arabidopsis 
belonging to two different families have been characterized 
(Lopato et al. 1996b; 1999). Both families show good 
homology in the amino- terminal RRM with animal SR proteins, 
but at their carboxy-terminal domains they are richer in 
arginine than in serine, have few SR dipeptides, and they 
were therefore termed RS proteins. Nevertheless, these 
proteins are recognized by the mAbl04 antibody and can 
complement SR protein-deficient HeLa SlOO extracts 
efficiently. 

In Arabidopsis thaliana, a protein with similarity to 
human SF2/ASF protein could be identified (Lazar et al- 
1995) which is known under the name of SRI, atSRp34 or 
atSRp34/SRl. 

Influencing the splicing process in plants is one of 
the possible ways of arriving at improved plants. 
Therefore, it is an object of the present invention to 
provide new plant splicing factors which can be used within 
the scope of biotechnology. 

According to the invention, this object is achieved by 
a protein having splicing factor activity in plants, which 
either 

©comprises the amino acid sequence of the protein according 

to Fig. lA of the appendix, or 
©comprises the sequence of the amino acids 1 to 4, 7 to 19, 

45 to 52, 111 to 116, and 149 to 153 of the protein 
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according to Fig. lA of the appendix and has more than 
85% similarity with this protein, or 
©comprises more than 60% similarity with the splice 

proteins atSRp34/SRl and SF2/ASF according to Fig. 2 of 
the appendix, wherein the G-rich sequence, which 
corresponds to the amino acids 85 to 113 of the 
atSRp34/SRl protein is substituted by an S-rich sequence, 
or 

©corresponds to, or is derived from, the protein 
corresponding to Fig. lA from a plant other than 
Arabidopsis thaliana . 

The protein according to the invention advantageously 
is characterized in that it comprises the sequence of the 
amino acids 1 to 4, 7 to 19, 22 to 72, 74 to 85, 96 to 141, 
149 to 153, 156 to 172, with amino acid 168 being variable, 
yet not D or N, of the protein according to Fig. lA of the 
appendix. The highly conserved sequences indicated 
preferably are arranged such that the distances between the 
sequences indicated approximately correspond to those 
according to Fig. lA of the appendix, preferably not more 
than 5 amino acid deletions or amino acid insertions being 
provided. Particularly preferred are sequences in which 
either no, or merely one single, deletion or insertion is 
present between the conserved sequences. 

The protein according to the invention may comprise 
one or more amino acid substitutions as compared to the 
protein illustrated in Fig. lA of the appendix, as long as 
the sequence of the amino acids 1 to 4, 7 to 19, 4 5 to 52, 
111 to 116, and 149 to 153 of the protein according to Fig. 
lA of the appendix or the function of the protein as 
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splicing factor are not affected. Its function as a 
splicing factor is to be considered to be affected if the 
protein comprises, e.g., only 10 to 20% of the natural 
activity of the protein according to Fig. lA of the 
appendix. 

Amino acid substitutions may particularly be provided 
in the region of the amino acids of Fig. 2 of the appendix 
not marked as conserved, in particular if they do not 
essentially influence the three-dimensional structure or 
the charge distribution of the protein. In these regions, 
e.g., an interchange of hydrophobic amino acids or an 
interchange of aromatic amino acids is possible, unless the 
natural activity is lost thereby. 

According to the invention, all proteins are preferred 
which comprise an amino acid sequence having at least 90% 
identity with the sequence of the amino acids 1 to 85 and 
96 to 222 of the protein according to Fig. lA of the 
appendix. 

Proteins which comprise an amino acid sequence that 
has at least 95%, in particular at least 98%, identity with 
the sequence of the amino acids 1 to 85 and 96 to 222 of 
the protein according to Fig. lA of the appendix are 
particularly preferred. 

With the proteins of the invention it has been 
possible for the first time to find a second SF2/ASF- 
similar protein in an organism (adjacent atSRp34/SRl) , 
resulting in the possibility of a combined influencing of 
the splice process. In the appendix it is shown that the 
two proteins mutually influence the splicing process but 
also have an effect on other proteins which, moreover, with 
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the appropriate genetic control, will result in surprising 
possibilities of changing the splicing properties of the 
plant or of the plant cell- 

The S-rich sequence between amino acid 95 and 102 of 
the sequence according to Fig. lA of the appendix 
preferably comprises at least 4, in particular at least 1 , 
serine residues. Advantageously, it is G-free, in 
particular free from GGR or GGR repeats. 

The protein according to the invention can be provided 
by standard methods of molecular biology also from other 
plants, because of the high homology of such proteins, such 
as by PGR coupled cloning methods or by other screening of 
gene libraries, e.g. with the sequence according to Fig. lA 
of the appendix. 

According to a further aspect, the present invention 
relates to nucleic acid molecules, which either 
©comprise a nucleic acid sequence according to Fig. lA of 

the appendix, or 
©comprise a nucleic acid sequence encoding a protein of the 
invention, or 

©comprise a nucleic acid sequence which, under stringent 
conditions, binds to the nucleic acid molecule according 
to Fig. lA and encodes a splice protein active in plants 
or is complementary thereto. 

By stringent conditions, e.g., hybridizing in 7% SDS, 

0.5 M NaPO^, pH 7.0, 1 mM EDTA at 50''C followed by washing 

with 3% SDS is to be understood. 

These molecules may either be DNA or also RNA 
molecules. In the case of RNA molecules, the alternative 
spliced forms, in particular those described in the 
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appendix, are preferred embodiments of the present 
invention . 

Another object of the invention is a biologically 
functional vector which is characterized in that it 
comprises a nucleic acid molecule according to the 
invention . 

In a further aspect, the invention moreover relates to 
a system comprising a nucleic acid which encodes a protein 
according to any one of the claims, and a nucleic acid 
which encodes 

ilDthe atSRp34/SRl protein of Arabidopsis thaliana or 
©the protein corresponding to Fig. lA from a plant other 

than Arabidopsis thaliana, or 
®a protein derived from these proteins, 

wherein at least one of the nucleic acids is under the 
control of a promoter not naturally connected with these 
nucleic acids. 

By atSRp34/SRl-derived proteins also those proteins 
are to be understood which define the proteins according to 
the invention, starting from the protein illustrated in 
Fig, lA, in analogous manner. 

Preferably, both nucleic acids are controlled by 
promoters which are not naturally connected with these 
nucleic acids. 

Preferably, at least one of the promoters under whose 
control the nucleic acids are is an inducible promoter. 

According to a preferred embodiment of the present 
invention, the nucleic acid encoding a protein according to 
the invention is under the control of a promoter which 
causes an overexpression of this protein. 
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Preferably, the nucleic acid which encodes a protein 
according to the invention is under the control of a 
promoter which, under defined conditions, prevents the 
expression of this protein and, under defined different 
conditions, enables the expression of this protein. 

According to a further aspect, the present invention 
relates to transgenic plants or plant cells which express a 
protein according to the invention and comprise a nucleic 
acid molecule, a vector or a system according to the 
invention. 

Moreover, the present invention relates to the use of 
a protein according to the invention, a nucleic acid 
according to the invention, a vector according to the 
invention or a system according to the invention for 
changing the splice properties of a plant cell or of a 
plant . 

Moreover, the subjects of the invention may be used 
for changing the development behavior of a plant. 

A particular aspect lies in the use for retarding the 
flowering stage of plants. 

According to the invention, it has been shown that by 
such a use, a retardation of the flower formation of a 
plant of at least 15%, preferably at least 25%, as compared 
to its wild-type can be induced, e.g. if the protein 
according to the invention is put under the control of a 
strong promoter. 

Particularly preferred plants within the scope of the 
present invention are economic crop plants, such as 
cereal, beans, rice or fruit plants. 
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The invention will now be explained in more detail by 
way of the following examples as well as the drawing 
figures^ yet it shall not be restricted thereto. 

In the Figures, 

Fig. lA shows a genomic sequence of atSRp30; promoter 
and intron sequences are indicated in lower-case, cDNA 
sequences in upper-case, and the TATA box in bold italics. 
The derived protein sequence is shown below the DNA in the 
one-letter code. The bold sequence in the tenth intron is 
contained in alternative spliced mRNAs, and the underlined 
sequence is included if both cryptic 3' and 5' splice sites 
are used. The conserved RNP submotifs of both RRMs are 
boxed. The open arrows mark the ends of promoter sequences 
used for expression studies. Solid horizontal arrows 
indicate the ends of the PGR product obtained with 
degenerate primers and used as a probe for library 
screening, (i) The end of the GatSRpSO clone. The sequence 
data of the atSRpSO gene was submitted to the EMBL database 
(accession number AJ131214) . 

Fig. IB shows a schematic representation of GatSRp30, 
its mRNA isoforms, and derived proteins. Exons are shown as 
boxes and introns as lines (bold lines: introns included in 
the alternatively spliced mRNAs). Exonic 5' and 3' 
untranslated regions are shaded, and the coding regions are 
black. (*) The new stop codon in the alternatively spliced 
products . 

Fig. IC shows a schematic representation of GatSRp34, 
its mRNA isoforms, and derived proteins. The drawings are 
as in B (accession number AF001035) . 
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Fig. ID shows a partial genomic sequence of 
atSRp34/SRl starting from exon 10 up to the stop codon. The 
bold sequence in the long intron is included in 
alternatively spliced mRNAs, the underlined sequence is 
included when both cryptic 3* and 5* splice sites are used. 

Fig. 2 shows the alignment of Arabidopsis atSRpSO, 
atSRp34/SRl, and human SF2/ASF protein sequences (solid 
area) . Positions at which a single residue occurs in at 
least two of the sequences (shaded areas) . Conservative 
substitutions such as RK, IVL, ED, FY, ST. The positions of 
the conserved RNP-1 and RNP-2 submotifs and the glycine- 
rich region are indicated. 

Fig. 3 shows the expression of atSRp30 and atSRp34/SRl 
in wild-type Arabidopsis thaliana. (A) Expression in 

different plant tissues. Northern blot analysis of poly (A) 
RNA from leaves (L) , stems (S) , roots (R) , and flowers (F) 
is shown. One microgram of RNA was loaded per lane, and the 
blots were either probed with a probe corresponding to the 
tenth intron of GatSRp30, or with the cDNAs of atSRp30 or 
atSRp34/SRl. (B, C) Developmental expression of atSRp30. 
Total RNA was isolated from whole plants on different days 
during development starting from the day of germination. 
The RNA was either used for Northern blot analysis with 
10 pg/lane of total RNA and the membrane probed with 
atSRp30 cDNA (B) , or for analysis of the RT-PCR products on 
a 1.2% agarose gel (C) . The two primers for the PGR 
reaction were located in the exons adjacent to the tenth 
intron. 

Fig. 4 shows the immunodetection of phosphorylated and 
dephosphorylated SR proteins from cell cultures of 
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Arabidopsis . (A) Immunodetection of proteins in SR protein 
preparations (lanes 1-4). (Lane 1) Monoclonal antibody 
inAbl04, which is specific for a common phosphoepitope of SR 
proteins; (lane 2) polyclonal antibody raised against 
recombinant atSRp30; (lane 3) polyclonal antibody raised 
against recombinant atSRp34/SRl; (lane 4) monoclonal 
antibody specific for human SF2/ASF. Protein markers are 
indicated at the left. (B, C) Dephosphorylation of SR 
proteins with alkaline phosphatase. (Lanes 1, 2) SR 
proteins incubated at 37 °C without enzyme for 0 and 24 h, 
respectively; (lanes 3, 4) SR proteins incubated with 0.2 
and 1.0 U/ial of enzyme for 3 h; (lanes 5, 6) SR proteins 
incubated with 1.0 and 2.0 U/pl of enzyme for 24 h. 
Proteins were probed either with anti-p30 (B) or anti-P34 
(C) . 

Fig. 5 shows the transcriptional analysis of atSRp30 
(A-D) and atSRp34/SRl (E-H) promoter-GUS fusions in 
transgenic Arabidopsis. GUS staining of (A, E) flowers at 
the postanthesis stage; (B, F) staining patterns in the 
leaves; (C, G) primary and developing lateral roots; (D, H) 
seedlings (2 days after germination) . 

F±g. 6 shows the overexpression of atSRp30 in 
transgenic Arabidopsis plants. The cDNA and the genomic 
sequences of atSRp30 were cloned under the control of the 
strong 35S CaMV promoter (constructs pC30 and pG30, 
respectively) and used to transform Arabidopsis plants. (A) 
Northern blot analysis of total RNA isolated from 
independent transgenic lines probed with atSRp30 cDNA, 
(Lanes 1-4) Plants transformed with the pG30 construct; 
(lanes 5-9) plants transformed with pC30; (lane 10) control 
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plant transformed with pBI121 (35S CaMV-GUS construct) . 
mRNA isoforms are indicated. (B) RT-PCR analysis of the 
same transgenic lines. The primers used are indicated with 
arrows in the diagram of GatSRp30. (Lane 10) Four times 
5 more material was loaded in this control lane. (C) 

Immunodetection of atSRpSO in total protein extracts of 
overexpressing transgenic plants. (Lane 1) SR protein 
preparations from Arabidopsis cell culture (wild type) ; 
^ (lane 2) line transformed with pG30; (lanes 3-7) lines 

2 10 transformed with pC30; (lane 8) pC30 transgenic line that 

does not overexpress mRNAl . Antibody raised against atSRp30 
li (anti-p30) was used for immunodetection, 

ry Fig. 7 shows phenotypic changes in plants 

overexpressing atSRp30. (A) Plants overexpressing atSRp30; 
^ 15 (B) wild-type plants; (C) plants overexpressing atSRp30 
ry grown under short day conditions displays no apical 

dominance, resulting in a bushy phenotype. 

Fig- 8 shows the regulation of the alternative 
splicing pattern of atSRp31 and U1-70K in plants 
20 overexpressing atSRp30. The relevant alternative splicing 
events are shown in the diagrams at the top. RT-PCR was 
carried out using primers (marked with arrows) derived from 
the adjacent exons . (A) Changes in splicing in the second 
intron of GatRSp31. (Lane 1) Control line transformed with 
25 pBI121 (35SCaMV-GUS construct), (lanes 2-5) lines 

transformed with the pG30 construct; (lanes 6-10) lines 
transformed with pC30. (B) Correctly spliced mRNA encoding 
U1-70K protein is produced upon overexpression of atSRp30. 
(Lane 1) control plant transformed with pBI121 (35SCaMV-GUS 
30 construct), (lanes 2, 3) lines transformed with the pG30 
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construct; (lanes 4-8) plants transformed with pC30; (lane 
9) control pC30 transgenic plant that does not overexpress 
mRNAl; (lane 10) 19-day old wild-type plant. 

Fig. 9 shows changes in the expression pattern of 
atSRp34/SRl in plants overexpressing atSRpSO. (A) 
Preferential usage of an alternative 5' splice site in the 
eleventh intron of atSRp34/SRl. The gene^ the relevant 
alternative splicing events, and the derived proteins are 
depicted at the top, the gel showing the RT-PCR products 
obtained with primers from the adjacent exons. (Lane 1) 
Control, untransf ormed wild-type plant; (lanes 2 to 5) 
lines transformed with the pG30 construct; (lanes 6 to 11) 
plants transformed with pC30; (lane 11) control pC30 
transgenic plant that does not overexpress mRNAl. (B) 
Immunodetection of atSRp34 and atSRp34s in total protein 
extracts from transgenic plants. (Lanes 1, 2) SR protein 
preparations isolated from Arabidopsis plants and cell 
culture, respectively; (lane 3) control plant transformed 
with pC30 that does not overexpress mRNAl; (lane 4) line 
transformed with pC30; (lane 5) line transformed with the 
pG30 construct. Anti-p34 antibodies were used for 
immunodectection . 
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Examples : 

M2^TERIAIiS AND METHODS 

Purification of SR-proteins and protein sequencing 
The SR protein purification has been described 
5 previously (Lopato et al. 1996a). The proteins were 

separated by SDS-PAGE. After Coomassie G staining, the 
protein bands were excised and subjected to in-gel 
digestion with Lys-C endopeptidase . The resulting peptides 
were separated by HPLC and sequenced by automated Edman 
10 degradation as described previously (Wang et al. 1996) . 

Isolation and sequencing of genomic DNA and cDNA of 
atSRp30 and atSRp34/SRl 
ry Total DNA of carrot, tobacco, and Arabidopsis was 

isolated by the cetyltrimethylammonium bromide (CTAB) 
15 method (Murray and Thompson 1980) . The most conserved part 
of the sequenced peptide YVGNLPGDI was used to design two 
degenerate forward primers. Two reverse degenerate primers 
were derived from the peptide sequence SWQDLKDHM, wich was 
also part of a sequenced peptide- All four combinations of 
20 degenerate primers were used in PGR reactions with genomic 
DNA from different plants as a template. PGR products were 
subcloned and sequenced. The PGR product from Arabidopsis 
was used as a probe for screening a genomic X-ZAPII library 
of a. thaliana var. Columbia (Stratagene) . One positive 
25 clone, GatSRpSO, was found, mapped, subcloned, and 

sequenced (EMBL accession number AJ131214) . A partial cDNA 
sequence was found by a BLAST search through the EST 
databases with the accession number R65514, 4018 
Arabidopsis thaliana cDNA clone 17J1T7, and was kindly 
30 provided by the Arabidopsis Biological Resource Genter at 
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Ohio State University. Sequence homology was analyzed using 
Genetics Computer Group (GCG, University of Madison, 
Madison, WI) sequence analysis software package version 
7.1-UNIX. Sequencing of the cDNA and the genomic clones was 
done using an A.L.F. DNA Sequencer and Auto Read Sequencing 
kit (Pharmacia) . To obtain a cDNA clone of atSRp34/SRl, the 
EST clone with the accession number ACT76795, 11573 atcDNA 
clone 151I9T7, was sequenced. The genomic sequences for 
atSRp34/SRl (accession number AF001035) were isolated by 
PGR and sequenced. 

Analysis of alternatively spliced isoforms 
RNA blot analyses were done as described (Lopato et 
al. 1996b). cDNAs encoding alternatively spliced isoforms 
were obtained by RT-PCR using total RNA preparations from 
A. thaliana plants at different stages of development and 
primers derived from the 3' untranslated region (for 
reverse transcription) . 

( 1 ) 5 ' -AAAT GAGCTC AAATGTATATGTATGGAAAAACC- 3 * 
(atSRp30) and 

( 2 ) 5 ' -AATGAGCTCGAAACGATATCTTCAAAAAAAAAC-3 ' 
(atSRp34/SRl) 

(The underlined nucleotides correspond to restriction 
sites) and from the beginning of the 5' untranslated region 
at the end of coding region for (PGR) : 

( 3 ) 5 ' -AAAGTGGATCCAGAAGAATGTAAGGGTTTGTGG- 3 ' 
(atSRp30) and 

( 4 ) 5 ' -ATATAGGATCCTGAAGGAGAUAUGAGAGGTG- 3 ' 
(atSRp30); and 

( 5 ) 5 • -AAATATCTAGAGATGTGAAATGGAGGAGG-3 ' 
(atSRp34/SRl) and 
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(6) 5 ' -ATATA GGATCC CATTTTACCTCGATGGAC-3 ' 
(atSRp34/SRl) . 

All products were sequenced. Alternative splicing of 
the long introns was studied using primers derived from 
ad j acent exons : 

( 7 ) 5 ' -AAT GAGCTCTG TGTCACCTGCTAGATCC- 3 ' 
(atSRp30) and 

( 8 ) 5 ' -ATAT AGGATCCA GATATCACAGGTGAAAC - 3 ' 
(atSRp30); and 

( 9 ) 5 ' -AT AGGATCCA GGAGCAGAAGTCCCAAGGCAAAG-3 ' 
(atSRp34/SRl) and 

(10) 5 • -AAA GTCGACA GAAGGTAGAGGAGATCTTGATC-3 ' 

(atSRp34/SRl) . 
The fragments were analysed on a 1.2% agarose gel. 
Constructs for promoter analysis and overexpression 
Approximately 1 kb of promoter sequences of atSRp30 
and atSRp34/SRl plus their complete 5* untranslated region 
(including the first intron in the case of atSRp34/SRl) 
were obtained by PGR from genomic clone GatSRp30 and 
genomic DNA of A. thaliana^. respectively, as a template, 
using primers 

(11) 5 ' -AAACTAAGCTTGGTATCTTCTTCCCTGCAAG- 3 ' 
(atSRp30) ; 

( 12 ) 5 ' -AAACCTAGGCGGCTACTCAGCTGATACCTCAGAGCAG-3 ' 
(atSRp30) ; 

(13) 5 ' -AAACTAAGCTTAAATATTGAACCGGCCTCGGTTC-3 ' 
(atSRp34/SRl) ; 

(14) 5 ' -AAACTGGATCCTCTTCCTGTTGGTCGTCGACGATTTG- 3 ' 
(atSRp34/SRl) ; and 

(15) 5 ' -AAACTGGATCCTCTTCCTTTATCAAATCC-3 ' 
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(atSRp34/SRl) 

containing Hindlll and BamHI restriction sites. These 
fragments were digested with Hindlll and BamHI and fused to 
the GUS reporger gene (Jefferson 1987) in pBIlOl 
(Clontech) . 

The cDNA and the genomic sequences of atSRp30 were 
amplified from cDNA and genomic DNA from A. thaliana using 
primers 3 and 1, containing BamHI and Sad restriction 
sites, respectively. The DNA fragments were sequenced and 
inserted into the BamHI-SacI restriction sites of the 
pBI121 (Clontech) vector (with the GUS gene deleted) under 
the control of the 35S CaMV promoter, giving rise to 
constructs named pC30 and pG30, respectively. 

Cultivation of plants and suspension cultures and 
plant transformation procedure 

Conditions for maintaining carrot and tobacco BY2 
suspension cultures were as described earlier (Lopato et 
al. 1996a). The suspension culture of A. thaliana ecotype 
Columbia was a gift from Czaba Koncz (Max-Planck-Institut 
fur Zuchtungsf orschung, Koln, Germany) and was maintained 
as described in Lopato et al. (1999). A. thaliana ecotype 
Columbia was used in all transfer experiments. Seeds were 
surface-sterilized by a 30 min imbibition with water, 
followed by a 1 min incubation in 70% ethanol, 10 min in 
10% bleach and five rinses with sterile water. Sterilized 
seeds were grown in MS medium (Murashige and Skoog 1962), 
supplemented with 1% sucrose. Plants were usually 
maintained in 16 h light/8 h dark cycle at 23°C, unless 
otherwise indicated. The cell culture was grown in medium 
containing Murashige and Skoog salts, 2x Gamborg's vitamins 
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(Gamborg et al. 1968), 1 mg/liter 2 , 4-dichlorophenoxy- 
acetic acid (2,4-D), and 3% (wt/vol) sucrose. Cell cultures 
were incubated in 50 ml of medium in 250 ml conical flasks 
on a rotary shaker at 110 rpm at 23 °C in dim light. Cell 
suspensions were subcultured every 7 days and were diluted 
threefold with fresh medium at each subculture. 

All constructs were introduced into Agrobacterium 
tumefaciens LBA4404 (Hoekema et al. 1983), using a 
triparental mating procedure with the helper plasmid 
pRK2013 (Ditta et al. 1980) . Root explants of A. thaliana 
were transformed using the method described by Valvekens et 
al. (1988) with one modification: Root explants were taken 
from 15-day-old seedlings grown on vertically placed agar 
plates (Huang and Ma 1992) . Seeds of transgenic plants were 
germinated in the presence of 50 pg/ml of kanamycin. 

Histochemical assay of GUS expression 

Histochemical assays of GUS expression were carried 
out with 5-bromo-4-"Chloro-3-indolyl glucuronide (X-Gluc, 
Duchefa) as a substrate and were performed on intact 
seedlings or excised organs of mature plants grown in vivo 
as described by Jefferson (1987) . The samples were treated 
with 70% ethanol for 2-6 h to remove chlorophyll from 
tissues where necessary. 

Purification of SR proteins from Arabidopsis and 
immunoblotting 

SR proteins were purified from 3-week-old Arabidopsis 
plants or 5-day-old suspension cultures using a two-step 
salt precipitation method as described (Lopato et al. 
1996a) . Total protein extracts were prepared with the 
buffer for SR protein isolation (Zahler et al. 1992) . 
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Proteins from the magnesium precipitate and total protein 
extracts were separated on a 12.5% SDS gel. Protocols for 
immunoblotting and detection have been described earlier 
(Lopato et al. 1999) . Protein dephosphorylation was carried 
out with alkaline phosphatase from calf intestine 
(Biolabs) . 

Expression of atSRp30 and atSRp34/SRl in bacteria 

The coding region of atSRp30 cDNA was amplified by PGR 

using the primers 5 ' -ATATA CCATGG GTAGCCGATGGAATCGTAC-3 ' and 

(4) . 

The coding region of atSRp34/SRl cDNA was amplified by 
PGR using the primers 5 ' -ATAT ACCATGG GGAGTCGT-TGGAG-3 ' and 
(6) . The primers contain Ncol and BamHI restriction sites. 
To obtain the Ncol restriction site, the fourth nucleotide 
of the coding region in both cases was changed to G. Thus 
expressed atSRpSO and atSRp34/SRl have Ser2-Arg and Ser2- 
Gly substitutions, respectively. The fragments were cloned 
into the bacterial expression vector pET-3d (Novagen) and 
transformed into the E. coli strain BL21 (DE3) pLysS 
(Novagen) . 

Single colonies were grown in 100 ml of LB medium to a 

density of 0.4-0.5 0D^^°, transferred to 900 ml of fresh 
pre-warmed medium and incubated for 1 h. The cultures were 
induced with 0.4 mM IPTG and were grown for a further 5 h. 
The bacterial pellet was washed twice in 500 ml of ice-cold 
washing buffer (100 mM NaCl, 10 mM Tris-HCl at pH 7.5) and 
used for inclusion body isolation: 2 grams of bacterial 
pellet was resuspended in 18 ml of ice-cold buffer A (10 mM 
Tris-HCl at pH 7.9, 100 mM KCl, 2 mM DTT, 35% sucrose). 4.5 
ml of ice-cold buffer B (333 mM Tris-HCl at pH 8.0, 100 mM 
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EDTA at pH 8.0, 40 mg of lysozyme) was added and incubated 
on ice for at least 10 min. Lysis buffer (22,5 ml, 1 M 
LiCl, 20 mM EDTA, 0.5% NP-40) was added, and the solution 
was sonicated for a total of 2 min at full power (15 sec 
bursts with cooling periods of 1 min in between) . Following 
centrifugation for 10 min (Sorvall, SS34, 13000 rpm) , the 
pellet was washed twice in 25 ml of buffer C (10 mM Tris- 
HCl at pH 8.0, 0.1 mM EDTA, 0.5 M LiCl, 0.05% NP-40, 1 mM 
DTT) , and then twice with the same buffer lacking LiCl- The 
isolated inclusion bodies contained >95% pure protein and 
were used for polyclonal antibody preparation (Lopato et 
al. 1999) . 

RESULTS 

Isolation and sequencing of genomic and cDNA clones of 
atSRpBO 

Initially, plant SR proteins in magnesium chloride 
pellets from protein extracts of carrot and tobacco cell 
cultures and from Arabidopsis plants were identified. The 
proteins were detected with the monoclonal antibody mAbl04, 
specific for a shared phosphoepitope in all SR proteins, 
and with a monoclonal antibody specific for the human 
SF2/ASF splicing factor (Lopato et al. 1996a). A protein of 
approximately 50 kD from the carrot SR preparation, which 
was recognized by both antibodies, was isolated and 
partially sequenced. Two peptides with significant homology 
to human SF2/ASF and atSRp34/SRl were found. The first 
peptide comprised the RNP-2 submotif of the first RRM, and 
the second peptide had high homology to a sequence situated 
between RNP-2 and RNP-1 of the second RRM (see Materials 
and methods; Fig. 2, above) . Based on these sequences. 
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degenerate primers were synthesized and used for PGR on 
purified genomic DNA from carrot, tobacco, and Arabidopsis- 
Sequencing of the cloned PGR products revealed two very 
homologous sequences from Arabidopsis and tobacco. The 
5 Arabidopsis fragment was 838 bp long and contained five 
introns. The fragment borders of the Arabidopsis fragment 
are marked with black arrows in Fig. lA. The protein 
sequence derived from the PGR sequence had extensive 

U homology to both human SF2/ASF and atSRp34/SRl (Fig. 2). 

^ 10 The PGR product from Arabidopsis was used as a probe to 

^ screen a X ZAPII genomic library of A. thaliana. One 

genomic clone was found and designated GatSRpSO (genomic 
clone of Arabidopsis thaliana serine/arginine-rich protein 

f with derived molecular mass of approximately 30 kD) . It was 

L-a., 

O 15 >4.5 kb long, had 1805 bp of promoter region, but ended 
Iz 12 bp upstream of the stop codon. A corresponding cDNA was 

O obtained from an expressed sequence tag (EST) cDNA library 

'"^ of A. thaliana ecotype Golumbia and sequenced. The mRNA 

corresponding to this cDNA was designated mRNAl of atSRp30. 
20 Primers derived from the 3' untranslated region of this 
cDNA were used to obtain the missing sequences of the 
genomic clone by PGR amplification on purified genomic DNA. 
The DNA fragment contained an additional intron, and the 
rest was identical to the corresponding sequence of the 
25 cDNAl. The sequence of GatSRp30 and the derived protein 
sequence are shown in Fig. lA. 

To facilitate comparison of features of atSRp30 and of 
the closely related Arabidopsis atSRp34/SRl protein (Lazar 
et al. 1995), the genomic sequence of this protein was 
30 determined using PGR. GatSRp34 has a very similar gene 
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structure to that of GatSRpSO, except for an intron present 
in the 5' untranslated region of the former. The fragment 
of GatSRp34 comprising the part from exon 11 to the stop 
codon is shown in Fig. IC, where the alternative splicing 
events involving intron 11 are also indicated. 

Comparison of atSRp30 to other SR proteins 
The derived protein sequences of atSRp30 and 
atSRp34/SRl are very homologous (80.7% similarity and 67.1% 
identity) to each other, and both show very high similarity 
(75.3 and 77.8%, respectively) and identity (58.1 and 
59.4%, respectively) to human SF2/ASF (Fig. 2). As atSRp30 
and atSRp34/SRl are less homologous to other animal or 
plant SR proteins identified to date, both proteins can be 
considered true orthologs of human SF2/ASF. Within their 
amino-terminal portions, all three proteins contain two 
RRMs with their conserved RNP-2 and RNP-1 submotifs, 
whereby the second RRM is atypical and contains the 
invariant SWQDLKD signature characteristic of SF2/ASF-like 
SR proteins (Birney et al. 1993). However, unlike 
atSRp34/SRl and SF2/ASF, the RRMs of atSRp30 are not 
separated by a glycine-rich 'hinge* region but rather by a 
serine-rich sequence. The RS domain of atSRp30 is shorter 
than the one of atSRp34 because of an extension at the 3' 
end of atSRp34 which includes a previously described 
positively charged proline/serine/lysine-rich (PSK) domain 
of unknown function (Lazar et al. 1995). If this unique 3* 
extension is not taken into account, atSRp34/SRl is 
slightly more homologous to human SF2/ASF, mainly because 
of their common G-rich hinge region. Taken together, these 
analyses suggest that in contrast to mammals, for which 
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only one SF2/ASF protein has been described to date, two 
SF2/ASF homologs exist in Arabidopsis. 

RNA distribution and alternative splicing forms of 
atSRp3Q and atSRp34/SRl 

RNA blots of poly (A) mRNA from various tissues of 
wild-type Arabidopsis plants were probed with radioactive 
labeled atSRp30 or atSRp34/SRl cDNAs and revealed at least 
two mRNA species in each case (Fig. 3A) . The level of 
expression of each gene varied considerably in different 
tissues but in both cases was highest in flowers, followed 
by roots (Fig. 3A) . In addition, the ratio of the two 
discernible mRNAs, mRNA3 and mRNAl, was different for each 
gene in the various organs. For atSRp30, mRNA3 seemed to be 
more abundant in leaves, stems and flowers, whereas mRNAl 
was predominant in roots. In contrast, the ratio of the two 
main mRNAs of atSRp34/SRl was approximately 1:1 in leaves 
and stems, whereas in roots and flowers mRNAl predominated. 
Reprobing of the RNA blot with the long tenth intron of 
atSRp30 showed that mRNA3 retained sequences of this 
intron, indicating that alternative splicing involves this 
region of atSRp30 pre-mRNA. 

The full sequences of the various mRNA isoforms were 
obtained by RT-PCR cloning from total RNA using primers 
from the 5' and 3' ends of the genes (see Materials and 
Methods) . mRNAl corresponded to the cDNA sequence of 

atSRp30, whereas mRNA3 retained part of the 10 intron, 
because of usage of an alternative 3' splice site (Fig. lA, 
B) . The mRNA2 isoform with both alternative 3' and 5* 
splice sites within intron 10 (Fig. lA, B) was not visible 
in Northerns of wild-type plants (see Fig. 3A) and was only 
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detected by RT-PCR by increasing the number of cycles (data 
not shown) . 

The same approach was used to sequence the mRNA 
isoforms of atSRp34/SRl (Fig. IC) . The longest mRNA 
species, which is also the most abundant in wild-type 
plants, was named mRNA5 and retained most of intron 11 
(which is in the position corresponding to intron 10 of 
atSRpSO) except for an internal intron of approximately 80 
nucleotides. Two other minor alternative mRNAs could be 
detected during early plant development, but only by RT-PCR 
(data not shown) . mRNA4 uses an additional alternative 3' 
splice site; mRNA2 uses alternative 5' and 3' splice sites 
(Fig. IC, D) ; and mRNA3, which is the main alternatively 
spliced mRNA in plants when atSRp30 is overexpressed from 
the SSSCaMV promoter (Fig. 5, below) uses one alternative 
5' splice site. 

The above alternatively spliced mRNAs of atSRp30 and 
atSRp34/SRl all had an in-frame stop codon near the 5' end 
of the retained intronic sequences. The hypothetical 
shorter proteins, designated atSRp30s and atSRp34s, lack 
the carboxy-terminal part of the RS domain and instead have 
other sequences that are shown in bold in Fig. lA for 
atSRp30s and in Fig. IC for atSRp34s. 

To obtain information about changes in the 
transcriptional and alternative splicing patterns of 
atSRp30 during plant development, total RNA was isolated 
from whole plants of different ages and used for RNA blot 
hybridization and RT-PCR analysis (Fig. 3B, C) . Primers 

derived from exons adjacent to the 10^^ intron were used 
for RT-PCR. The results from both methods were in good 
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agreement and showed that the expression of rtiRNAl is 
highest in younger plants and starts to decline around day 
12, whereas the expression of mRNA3 is extremely low in 
young seedlings, peaks between days 9-14, and declines 
slowly afterwards. Although it is not known how the ratio 
of these two transcripts is regulated, these regulatory 
events might determine the quantity of genuine atSRp30 
protein in individual cells. 

Antibdoy detection of Arabidopsis SR proteins 
Polyclonal antibodies were raised in chickens against 
purified recombinant atSRp30 and atSRp34/SRl and used to 
identify antigens in different ammonium sulfate fractions 
of Arabidopsis extracts. The extracts were fractionated by 
sequential precipitation with ammonium sulfate and 
magnesium chloride, to enrich for SR proteins (Roth et al . 
1990) . The anti-p30 and anti-p34 antibodies recognized 
proteins in the 60%-90% ammonium sulfate part. The 
immunoreactive proteins in the dialyzed 60%-90% part 
precipitated quantitatively in the presence of 20 mM 
magnesium chloride. No crossreaction of plant proteins with 
the pre-immune immunoglobulin fraction was found (data not 
shown) . 

The magnesium precipitate was immunoblotted and probed 
with four different antibodies. With anti-p30, six protein 
bands were detected as three doublets, with the 43- to 4 6- 
kD doublet showing the highest intensity. The two other 
doublets migrated with apparent molecular masses of 38-40 
and 31-34 kD (Fig. 4A, lane 2) . The immunostaining pattern 
with the monoclonal antibody mAbl04 (Fig. 4A, lane 1), 
which is specific for a serine phosphoepitope common to SR 
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proteins (Roth et al . 1990) was very similar, although the 
smallest band was more pronounced than with anti-p30 (lane 
2) . This could be indicative of an additional SR protein of 
approximately 30 kD, or it may reflect the presence of 
multiple or stronger phosphoepitopes within p30. No high 
molecular mass proteins were detected with the mAbl04 
antibody, although Lazar et al. (1995) have reported 
immunoreactive proteins up to 120 kD. 

The anti-p34 antibodies recognize mainly three 
proteins of approximately 46-47, 40 and 34 kD. As the same 
antibody shows minimal cross-reaction with recombinant 
atSRp30 (data not shown) , these bands possibly represent 
atSRp34/SRl-related proteins and correlate well with 
published overexpression data in which the protein was 
immunodetected with a size of 47-48 kD (Lazar et al. 1995) . 
The 40-kD protein comigrates with an alternatively spliced 
isoform of atSRp34/SRl (see below), whereas the nature of 
the 34-kD polypeptide remains to be determined. 
Interestingly, the immunoblot with anti-p34 is very similar 
to that obtained with a monoclonal antibody specific for 
human SF2/ASF (Fig. 4A, cf . lanes 3 and 4) . Therefore, 
anti-hSF2, which recognizes a discontinuous epitope within 
RRMl of hSF2/ASF but does not recognize other human SR 
proteins (Hanamura et al. 1998) cross-reacts with the same 
set of Arabidopsis proteins as anti-p34, supporting the 
notion that hSF2/ASF resembles atSRp34/SRl in this region 
more closely than it does atSRp30. 

The fact that atSRp30 and atSRp34/SRl are present in 
the SR protein preparation and comigrate with specific 
bands in the mAbl04 immunoblot suggests that both proteins 
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are phosphorylated . To test this suggestion, the SR protein 
preparation was treated with increasing concentrations of 
alkaline phosphatase and for different times (Fig, 4B, C) . 
Compared to the untreated controls (lanes 1, 2), these 
treatments resulted in an increase in the mobility of all 
immunostained bands. Under these conditions, anti-p30 
recognized predominantly three proteins {21, 31, and 
38 kD) , whereas anti-p34 recognized two proteins (31 and 
38 kD) . As expected, all these proteins were no longer 
stained with mAbl04, except for the 38-kD band, which 
probably represents a partially dephosphorylated 
intermediate resistant to further dephosphorylation (data 
not shown) . Interestingly, dephosphorylation led to the 
appearance of new bands of higher apparent molecular mass, 
consistent with the observation that unphosphorylated 
recombinant atSRpBO and atSRp34/SRl are very insoluble and 
have a strong tendency to aggregate . 

Transcriptional activity of SR protein promoters in 
Arabidopsis 

It is striking that in contrast to mammals, 
Arabidopsis possesses two SF2/ASF-like proteins. As 
revealed by the Northern blot analysis, the genes of both 
proteins are transcribed in all plant organs. It was 
therefore of interest to investigate if both genes are 
generally transcribed in all tissues or specifically in 
different tissues, or to which extent they have overlapping 
activities. To this end, the promoters of GatSRp30 and 
GatSRp34, including the 5' untranslated regions up to the 
start codon, were fused to the coding region of a reporter 
(3-glucuronidase gene and transferred into A. thaliana 
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plants by Agrobacterium tumef aciens-mediated root 
transformation. p-Glucuronidase (GUS) activity was never 
observed in any tested nontransgenic Arabidopsis tissues^ 
as well as in transgenic plants harboring a promoterless 
GUS gene. Control plants containing a 35 S cauliflower 
mosaic virus (CaMV) promoter-Gus fusion were easily 
stained, demonstrating that substrate accessibility was not 
limiting GUS activity. To exclude an influence of the 
transgene position on promoter activity, the analyses were 
carried out with several independent transgenic lines. 

Using detailed histochemical analyses, it was found 
that both genes had distinct but also overlapping patterns 
of transcriptional activities. Flowers in the postanthesis 
stage (Fig. 5A, E) showed essentially the same staining 
patterns for both genes, with the most pronounced 
expression in the pollen grains and weak staining of sepals 
and style. Additionally, weak activity was observed in the 
abscission zone of the flower in atSRp30-GUS transgenic 
plants (Fig. 5A) . The leaves of atSRp30-GUS plants showed 
strongest staining in vascular bundles and in the support 
cells of each trichome (Fig. 5B) , whereas the atSRp34-GUS 
fusion was weakly active only in secondary veins and in the 
cells around them, but activity was never observed in 
trichomes or support cells (Fig. 5F) . The differences in 
the GUS staining patterns of these genes were especially 
evident during the different stages of lateral root 
development. Both atSRp30 and atSRp34-GUS fusions were 
active at the very first stages of lateral root initiation, 
when pericycle cells are stimulated, de-differentiate, and 
proliferate. Then, at the later stages, when 
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redif f erentiation occurs to form the lateral root meristem, 
atSRp30-GUS was expressed in the basal enlarged cells of 
the forming lateral root (Fig. 5C) , whereas atSRp34-GUS 
expression was characteristic of the activated root 
meristem (Fig. 5G) . The staining patterns during early 
seedling development provided additional evidence for the 
distinct transcriptional activities of atSRp30 and 
atSRp34/SRl. Each of the genes was expressed in specific 
regions along the apical-basal axis of the seedling. In 
atSRp30-GUS seedlings, expression was restricted to the 
cotyledons (Fig. 5D) , whereas in atSRp34-GUS seedlings, 
expression was observed in hypocotyl and in root (Fig. 5H) , 
suggesting that these genes can be transcriptionally active 
in regions with different patterns of cell division. This 
suggestion was confirmed by differential expression of 
atSRp30 and atSRp34/SRl during lateral root formation, as 
well as in trichome support cells. 

Overexpression of atSRp30 in transgenic plants 
In Drosophila, ectopic overexpression of SRp55/B52 
resulted in various developmental abnormalities, although 
the identity of affected transcripts remains unknown (Kraus 
and Lis 1994) . It was therefore of interest to study the 
effect of overexpression of atSRp30 on plant development 
and on the alternative splicing patterns of individual 
plant transcripts. The constructs used contained either a 
complete gene (pG30) or a cDNA (pC30) encoding atSRpSO 
under the control of the strong constitutive promoter of 
the 35S RNA from CaMV. The 35S CaMV promoter is strong and 
constitutive in all plant tissues studied, which was 
confirmed in control experiments using GUS as a reporter 
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gene- To control for the conditions of the regeneration and 
transformation procedure in the RNA analysis of 
transf ormants, negative controls were either transformed 
with a 35SRNA promoter-GUS control (pBI121) or were 
transformants with the same construct, but which for 
unknown reasons did not show overexpression . 

Both fusion constructs, pG30 and pC30, were used for 
Agrobacterium-mediated transformation of Arabidopsis roots - 
Forty independent transgenic lines were regenerated for 
each construct; eight of the lines transformed with pG30 
and twelve transformed with pC30 were used in further work- 
Some of these transgenic lines were used for RNA blot and 
RT-PCR analyses as shown in Fig. 6A and B. Surprisingly, 
all pG30 transformants (Fig. 6A, B, lanes 1-4) expressed 
mainly mRNA3, which possesses an alternative 3' splice site 
within the long intron (Fig. IB), but nevertheless, mRNAl 
was still more abundant in these plants than in the control 
plant (Fig. 6A, lane 10), as judged by the RNA blot 
analysis (four times more product was loaded in lane 10 in 
Fig. 6B) . Although mRNAl levels in both types of 
transformants are quite different (Fig. 6A) , the RT-PCR 
shows similar yields (Fig. 6B) because a high number of 
cycles was employed to detect all alternatively spliced 
products. As expected, all transformants containing pC30 
overexpressed only mRNAl (Fig. 6A, lanes 5-9) . In both 
types of transformants, overexpressed mRNAs were >100-fold 
more abundant than in wild-type plants. Interestingly, when 
pC30 transformants were analyzed by RT-PCR, only mRNA2, 
which uses cryptic 3' and 5' splice sites in intron 10 
(Figs. 6B and IB), was present, whereas a small amount of 
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mRNA3 was visible in wild-type plants. This result shows 
that an excess of atSRpSO changes 5 ' -splice-site selection 
of the endogenous atSRp30 pre-mRNA. 

In light of the above results, it was of interest to 
5 measure overexpression of atSRp30 at the protein level. 
Total soluble protein extracts of transgenic plants were 
analyzed by Western blotting using anti-p30 for 
immunodetection (Fig. 6C) and compared to a control plant 
(lane 8, a transformed plant with no overexpression of 
10 atSRp30) and to an SR protein preparation from an 

Arabidopsis suspension culture (lane 1) . The SR protein 
preparation had the characteristic pattern for anti-p30 
(cf. Fig. 6C, lane 1 and Fig. 4A, lane 2), whereas no 
""^^ specific proteins were immunostained in the total protein 

15 extract of the control plant (Fig. 6C, lane 8), because of 
y the low abundance of atSRp30 in plants- In contrast, a 

yJ specific protein band was visible in the pC30 transf ormants 

E (lanes 3-7), comigrating with a protein band in the SR 

protein preparation (lane 1) . In contrast, in the pG30 
20 transformant (lane 2), only a faint protein band was 
detected. These results correlate well with the RNA 
expression pattern of mRNAl in both types of transf ormants 
(Fig. 6A) . It is interesting that overexpression of atSRp30 
yielded an immunostained protein of 38 kD and not a band 
25 comigrating with the more abundant 4 3- to 4 6-kD doublet 

seen in the SR protein preparation (Fig. 6C, lane 1) . One 
likely explanation is that overexpressed atSRp30 is only 
partially phosphorylated, which may be related to the 
observation that even exhaustive dephosphorylation results 
30 in a 38-kD atSRp30 protein that retains the mAbl04 
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phosphoepitope (Fig. 4B, lanes 3-6) . However, no shorter 
protein product (atSRp30s) of the highly abundant mRNAS in 
pG30 transf ormants could be detected with any antibody used 
(see also Fig. 6C, lane 2) . 

Phenotypic changes in plants overexpressing atSRp3Q 
Overexpression of atSRp30 resulted in strong 
phenotypes with pleiotropic changes both in morphology and 
development of the transgenic plants. No significant 
differences were observed between plants transformed with 
pG30 and pC30 constructs^ although the levels of atSRp30 
protein were different (Fig. 6C, cf . lane 2 and lanes 3-7) . 
The observations were reproducible in T2 and subsequent 

generations of independent transgenic lines, and cosegrated 
with antibiotic resistance. In transgenic plants, the 
transition from vegetative to flowering stage was delayed 
greatly under short day conditions. The time from 
germination to the formation of the first mature silique 
was 65-78 days in overexpressing plants, compared to 42-47 
days in control plants grown under the same conditions. In 
addition, adult plants showed reduced apical dominance, 
resulting in a 'bushy' phenotype with an increased number 
of secondary inflorescences and a shortened primary 
inflorescence (Fig. 7C) . Growing of the transgenic plants 
under long-day conditions resulted in partial reversion to 
a normal phenotype. But despite of the partial recovery of 
apical dominance, the transition to flowering and the time 
to senescence remained delayed. Under long-day conditions, 
the majority of plants from overexpressing lines started to 
flower between 35 and 39 days after germination, whereas 
control plants flowered between days 25 and 28. The initial 
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inflorescences were very short and did not exceed 0.5-1 cm 
when the first flower opened- The size of flowers in 
transgenic plants was approximately 30% larger than in 
control plants (4 and 3 mm, respectively) . Transgenic 
plants also had larger rosette leaves (Fig. 7A, cf. to wild 
type in B) with trichomes having predominantly four to five 
branches, compared to wild-type leaves, which had mainly 
three-branched trichomes. In strong overexpressing lines, 
the primary inflorescence produced numerous secondary 
branches with a vegetative rosette-like appearance. These 
branches developed from 7-15 vegetative-like leaves and 
finally formed inflorescences. Under long-day conditions, 
the rosette leaf numbers determined at the time of 
flowering were somewhat lower in transgenic lines compared 
to the wild type (an average of 12 and 16 leaves, 
respectively) . Independent transgenic lines overexpressing 
atSRp30 displayed the described characteristics to various 
extents, with the main impact on the time for transition 
from the vegetative to the reproductive phase. 

Qverexpressed atSRp30 modulates alternative splicing 
in vivo 

The influence of overexpression of atSRp30 on its own 
alternative splicing pattern and the high homology of 
atSRp30 to human alternative splicing factor SF2/ASF 
suggested that this protein could be a modulator of plant 
splicing. This assumption cannot be tested in vitro, as no 
plant splicing extracts are available. Therefore, total RNA 
preparations from pG30 transf ormants (lower level of 
atSRp30 overexpression) and pC30 transf ormants (higher 
level of atSRp30 overexpression) were used for RT-PCR 
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analysis of several plant introns that are known to be 
processed alternatively under wild-type conditions. 

Among several genes tested, such as FCA (Macknight et 
al. 1997), rubisco activase (Werneke et al- 1989), agamous 
(Yanofsky et al. 1990), and LSDl (Dietrich et al. 1997), no 
changes in the splicing patterns were observed in plants 
overexpressing atSRp30. However, pre-mRNA splicing of three 
plant genes was found to be affected by overexpression, as 
described below. 

The splicing factor atRSp31 belongs to a novel family 
of plant RS proteins and possesses an alternatively spliced 
second intron (792 nucleotides) (Lopato et al. 1996b). In 
the wild-type plant, either a cryptic 3* splice site or 
both alternative 3' and 5' splice sites were used in this 
intron (Fig. 8A) , whereas the latter form prevailed in 
control lines (lane 1) and pG30 transformed lines (lanes 2- 
5) . In contrast, pC30 lines (lanes 6-10) expressed mainly 
the alternative 3' splice site form, although its abundance 
was variable (in lanes 6 and 7 this form is only visible in 
the original photograph) . These transcripts are probably 
subject to nonsense-mediated decay and potentially code for 
the same truncated protein . 

Another well-described alternative splicing event 
occurs in the case of the U1-70K gene of Arabidopsis, in 
which transcripts retaining intron 6 are more abundant in 
most tissues (except roots) than the correctly spliced 
transcript (Golovkin and Reddy 1996) . The RT-PCR results 
from control plants (Fig. SB, lanes 1, 9, 10) confirm this 
observation. However, in both atSRp30 transf ormants, either 
the lower expressing pG30 lines (lanes 2 and 3) or the 
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higher expressing pC30 lines (lanes 4-8) , most of the 
transcripts were spliced correctly and only a low level of 
intron retention was observed. This regulatory event could 
potentially influence the level of active protein, 

The most interesting case of alternative splicing 
modulation was found in the case of the long intron (intron 
11) of GatSRp34, which is a close homolog of GatSRp30 (Fig. 
9). In wild-type plants (Fig. 9, lane 1), part of this 
intron is alternatively spliced to generate mRNA5 . However, 
in pGSO transformants (lanes 2-5), mRNAS was the main 
alternative spliced form, whereas the level of mRNAl 
appeared unchanged. Surprisingly, when atSRpSO was highly 
overexpressed, the alternatively spliced mRNAS became the 
main transcript, whereas the amount of mRNAl was 
considerably decreased. Assuming that shorter PGR products 
are probably overrepresented, the reduction of mRNAl may be 
even more drastic. This assumption was confirmed by 
immunoblot analysis of total protein extracts: atSRp34/SRl 
was deleted in control plants (Fig. 9B, lane 3) and in a 
pG30 transformant (lane 5), but not in a pC30 transformant 
(lane 4) . However, a smaller specific protein band 
(atSRp34s, 38 kD) became visible in this case, indicating 
that the protein product of mRNA3 predominates in this 
transformant. These results clearly show that 
overexpression of atSRp30 strongly influences gene 
expression of its close homolog, atSRp34/SRl. 
DISCUSSION 

atSRp30 is a member of the SR protein family 
In addition to their characteristic domains, which 
include one or two RRMs and a carboxy-terminal RS domain 
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with multiple SR dipeptides, SR proteins possess 
phosphoepitope (s) recognized by mAbl04, and they are 
insoluble in the presence of millimolar concentrations of 
magnesium salts. The antibody to atSRpSO recognized a 
complex pattern of bands in the Mg precipitate, whereas in 
total protein fractions no specific band could be detected. 
The failure to detect atSRp30 in crude lysates reflects the 
low abundance of the protein and/or the limited sensitivity 
of the antibody. However, when atSRpSO was overexpressed, 
the antibody detected a specific protein with an apparent 
mobility of 40 kD. In contrast, atSRp34/SR could be 
detected by anti-p34 in total protein fractions- The 
immunoblot data confirm that atSRpBO is a true SR protein, 
as it is present in SR protein preparations and shows 
corresponding immunostained bands with mAbl04. However, as 
complete dephosphorylation could not be achieved, it is not 
known how many of these proteins represent modified forms 
of atSRpBO or closely related proteins. 

Is the expression of atSRpSO autoregulating? 

When atSRpSO pre-mRNA was overexpressed, mRNAS, which 
is generated from use of an alternative 3" splice site in 
the tenth intron, was the main transcript detected, whereas 
the level of mRNAl was enhanced only moderately (Fig. 6) . 
There are several possible explanations for this 
phenomenon. First, there may be a limiting splicing factor 
that is titrated out because of overexpression of the 
atSRp30 gene. Such a factor would have to be specific for 
the tenth intron, as all the other introns in atSRp30 are 
spliced properly. Second, mRNA 3 and mRNAl may be 
synthesized for the most part in different cells, via cell- 
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type-specific alternative splicing. As the constitutive 35S 
CaMV promoter used in the overexpression experiments is 
active in most tissues the large amount of mRNA3 may 
reflect inappropriate expression in cell types in which a 
factor required for correct processing of the tenth intron 
is absent. To clarify this issue, experiments aimed at 
determining the cells in which the alternative splicing 
event takes place will be necessary. 

The fact that the ratio of mRNA3 to mRNAl is constant 
in all transformants overexpressing GatSRpBO (Fig. 6A) is 
consistent with a potential autoregulatory mechanism 
involving atSRpSO. Overproduced atSRpSO protein may 
stimulate the alternative splicing event, thus down- 
regulating its own expression, as has been shown to be the 
case for several animal splicing factors, such as Sxl (Bell 
et al. 1991), tra (Mattox and Baker 1991), and SWAP (Zachar 
et al. 1987) from Drosophila, as well as mouse SRp20 (Jumaa 
and Nielsen 1997) . In agreement with this model, it was not 
possible to detect a protein product from mRNA3 (atSRp30s) . 
Evidence was obtained for an influence of atSRp30 on 
alternative splicing of transcripts from the endogenous 
gene in plants overexpressing the atSRp30 cDNA (Fig. 6B, 
lanes 5-9), resulting in preferential use of an alternative 
5' splice site. Overexpression of atSRp30 might have a 
different effect on the large amount of exogenous pre-mRNA 
present in cells where this protein is not expressed 
normally resulting in preferential use of an alternative 3' 
splice site (Fig. 6A, B, lanes 1-4) . Unfortunately, this 
hypothesis cannot be tested in vitro because no plant 
splicing extracts are available. Future experiments should 
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address whether atSRpSO can indeed autoregulate its own 
production . 

atSRp30 overexpression influences splice-site choice 
in several plant pre-mRNAs and induces changes in plant 
development 

Many in vivo and in vitro experiments have shown that 
human SF2/ASF is involved in the selection of 5' splice 
sites by binding cooperatively to this site (Zuo and Manley 
1994) and to UlsnRNP (Kohtz et al. 1994; Jamison et al. 
1995; Zahler and Roth 1995) . Furthermore, like other SR 
proteins, SF2/ASF influences splice-site selection in part 
by binding to enhancer sequences that are present 
frequently in intronic or exonic regions. Upon binding to 
those elements, the protein activates splicing by 
recruiting the splicing mechanism to an adjacent splice 
site through protein-protein interactions (Wu and Maniatis 
1993; Amrein et al. 1994; Zuo and 

Manley 1994) • Arabidopsis atSRpBO was a good candidate for 
being a splicing modulator, given its similarity to human 
SF2/ASF and its peculiar expression pattern. The ability to 
stably overexpress atSRpBO in whole plants that remained 
viable made it possible to determine, for the first time, 
the effects of increased SR protein levels on alternative 
splicing of specific endogenous transcripts. Some but not 
all endogenous transcripts tested were affected. In the 
case of atSRpSO, pre-mRNA itself, and tSRp34/SRl/pre-mRNA, 
intronic alternative 5' splice sites were activated, 
whereas in atRSp31 and Ul 70K pre-mRNAs, the usage of the 
normal splice sites was generally enhanced but variable. 
There is no explanation for this variability in individual 
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transf ormants (see Fig. 8), as the atSRp30 protein levels 
in several pC30 transf ormants were similar (Fig. 6C) . The 
most pronounced effect of atSRp30 overexpression was on the 
splicing pattern of the close homolog atSRp34/SRl^ in which 
preferential use of the alternative 5' splice site gave 
rise mostly to a truncated mRNA isoform, atSRp34s^ and 
virtually no atSRp34/SRl protein. Although there is no 
information to date concerning the activity of the smaller 
atSRp34s, these results suggest the normal expression of 
atSRp30 protein in the same cells that transcribe 
atSRp34/SRl is important for the synthesis of authentic 
atSRp34/SRl protein. This regulatory loop might explain the 
different expression patterns of both factors in root 
tissue and young seedlings. It remains to be determined 
whether there is a reciprocal effect of atSRp34/SRl 
overexpression on alternative splicing of atSRp30 pre-mRNA. 

Plants overexpressing atSRp30 constitutively and 
ubiquitously showed interesting changes in morphology, as 
well as in several aspects of development. Although some of 
the details will have to be investigated more thoroughly^ 
many of the differences can be explained by changes in 
phase transition and in the definition of cell fates. In 
this work it has been demonstrated that atSRp30 can change 
the expression of other genes drastically by influencing 
their splicing patterns. As the expression pattern of 
atSRpSO is normally very tissue specific, some of the 
observed effects of overexpression might simply be caused 
by its expression in inappropriate tissues, where it 
influences the expression of genes that are not the natural 
targets of this splicing factor. In addition, the observed 
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decrease in the expression of the splicing factor atSRp34 
upon overexpressing atSRp30 may in turn affect the 
expression of other genes, leading to the observed 
phenotypic changes. On the other hand, the changes in 
trichome development resulting in additional branching may 
reflect overexpression of atSRp30 in the supporting cell of 
the trichomes, which is a normal site of atSRpBO 
expression. Therefore, atSRpSO may be a determinant of 
trichome development. Finding the natural targets of 
regulation by atSRpSO remains a critical goal to explain 
the observed changes in phenotype, as well as to promote 
the understanding of developmental pathways in plants. 
Two SF2/ASF-like factors in Arabidopsis 
Immunoblot analysis with mAbl04 has suggested 
previously that the complexity of the SR protein family is 
higher in animals than in plants (Lopato et al. 1996a), as 
several higher molecular mass proteins have been identified 
in SR preparations from mammals but not from those in 
plants. The existence of two SF2/ASF-like factors in 
Arabidopsis may compensate for the absence of orthologs of 
other members of the SR protein family. The similar gene 
structure of atSRpSO and atSRp34/SRl is indicative of an 
ancestral gene duplication event. Interestingly, the 
penultimate long intron is conserved in both genes and is 
involved in alternative splicing events. Judging from 
Northern blot analysis of RNAs from different plant organs, 
the ratio of the various SR protein mRNAs is quite variable 
and is probably reflected in the abundance of the resulting 
proteins. In both genes, the alternative splicing event 
results in predicted proteins with a truncated RS region 
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and a few new additional carboxy-terminal amino acids. 
Whereas a shorter protein of atSRp34/SRl could be detected 
clearly in plants overexpressing atSRp30, atSRp30s was 
never observed in these plants, although the corresponding 
alternatively spliced mRNA is abundant. Expression of 
atSRpSOs cDNA in Escherichia coli yielded stable protein 
(data not shown) . Therefore, either mRNAS is not translated 
efficiently, or it is degraded through a premature nonsense 
codon-mediated decay pathway, or atSRpSOs is preferentially 
recognized by one of the plant protein-degradation systems. 

Comparative analysis of atSRpSO with all known plant 
and animal SR proteins revealed that it is a paralog of 
atSRp34/SRl and that both Arabidopsis proteins are closest 
to human SF2/ASF, The main structural difference is that 
atSRp30 lacks a G-rich region between the two RRMs, which 
may influence the flexibility and perhaps the specificity 
of the RNA-binding region. The notion that the two proteins 
might have distinct activities is strengthened by the 
observation that in many cases their expression patterns 
are quite different, as shown in Fig. 5. In general, 
atSRp30 expression is more confined to specialized cell 
types and tissues, like trichomes, cotyledons, or lateral 
root primordia, suggesting a special role for this protein 
in the initiation of organ formation, whereas atSRp34/SRl 
is more strongly expressed in meristematic tissue. 
Furthermore, the present sequence and immunological data 
indicate that atSRp34/SRl is more similar to hSF2/ASF than 
is atSRp30. Taken together, these data suggest that 
atSRp34/SRl is a more general splicing factor, similar to 
hSF2/ASF, whereas atSRp30 might have more specialized 
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functions, perhaps acting as a regulatory splicing factor 
that modulates alternative splicing and gene expression in 
specific cell types. 
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